Testing for change points in time series 1
نویسندگان
چکیده
June 4, 2010 Xiaofeng Shao and Xianyang Zhang University of Illinois at Urbana-Champaign Abstract: This article considers the CUSUM-based (cumulative sum) test for a change point in a time series. In the case of testing for a mean shift, the traditional KolmogorovSmirnov test statistic involves a consistent long run variance estimator, which is needed to make the limiting null distribution free of nuisance parameters. The commonly used lagwindow type long run variance estimator requires to choose a bandwidth parameter and its selection is a difficult task in practice. The bandwidth that is a fixed function of the sample size (e.g., n, where n is sample size) is not adaptive to the magnitude of the dependence in the series, whereas the data-dependent bandwidth could lead to nonmonotonic power as shown in previous studies. In this article, we propose a self-normalization (SN) based Kolmogorov-Smirnov test, where the formation of the self-normalizer takes the change point alternative into account. The resulting test statistic is asymptotically distribution free and its power is monotonic. Furthermore, we extend the SN-based test to test for a change in other parameters associated with a time series, such as marginal median, autocorrelation at lag one, and spectrum at certain frequency bands. The use of the SN idea thus allows a unified treatment and offers a new perspective to the large literature of change point detection in the time series setting. Monte Carlo simulations are conducted to compare the finite sample performance of the new SN-based test with the traditional Kolmogorov-Smirnov test. Illustrations using real data examples are presented.
منابع مشابه
تحلیل نوسانات بارشهای حوضه آبریز دریاچه ارومیه با روش SMK در دوره آماری 2015-1986
In this study, in order to analyze the trends of annual precipitation, the information from 21 synoptic meteorological stations located in the Urmia Lake basin in a 30-year time period (1986-2015) was used. For this purpose, the Sequential Mann-Kendall test was used. The date of sudden change (if exist) in the precipitation time series of each station was identified. Significance of the trend i...
متن کاملTrend analysis and detection of precipitation fluctuations in arid and semi-arid regions
The most important impacts of climate change relate to temperature and precipitation. Precipitation is particularly important, because changes in precipitation patterns may lead to floods or droughts in different areas. Also, precipitation is a major factor in agriculture and in recent years interest has increased in learning about precipitation variability for periods of months to annual and s...
متن کاملA time series of infectious-like events in Australia between 2000 and 2013 leading to extended periods of increased deaths (all-cause mortality) with possible links to increased hospital medical admissions
Background and aims: Trends in deaths and medical admissions in the UK and Europe show evidence for a series of infectious-like events. These events have been overlooked by traditional surveillance methodologies. Preliminary evidence points to a rise in medical admissions in Australia around the same time as those observed in Europe, and this study was aimed to evaluate whether the deaths are o...
متن کاملBayesian Estimation of the Multiple Change Points in Gamma Process Using X-bar chart
The process personnel always seek the opportunity to improve the processes. One of the essential steps for process improvement is to quickly recognize the starting time or the change point of a process disturbance. Different from the traditional normally distributed assumption for a process, this study considers a process which follows a gamma process. In addition, we consider the possibility o...
متن کاملEnsemble Kernel Learning Model for Prediction of Time Series Based on the Support Vector Regression and Meta Heuristic Search
In this paper, a method for predicting time series is presented. Time series prediction is a process which predicted future system values based on information obtained from past and present data points. Time series prediction models are widely used in various fields of engineering, economics, etc. The main purpose of using different models for time series prediction is to make the forecast with...
متن کاملMissing data imputation in multivariable time series data
Multivariate time series data are found in a variety of fields such as bioinformatics, biology, genetics, astronomy, geography and finance. Many time series datasets contain missing data. Multivariate time series missing data imputation is a challenging topic and needs to be carefully considered before learning or predicting time series. Frequent researches have been done on the use of diffe...
متن کامل